SMALL RATIONAL POINTS ON 
ELLIPTIC CURVES OVER NUMBER FIELDS 
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Abstract. Let E/k be an elliptic curve over a number field. We obtain 
some quantitative refinements of results of Hindry-Silverman, giving an upper 
bound for the number of fc-rational torsion points, and a lower bound for 
the canonical height of non-torsion fc-rational points, in terms of expressions 
depending explicitly on the degree d = [k : Q] of k and the Szpiro ratio a of 
E/k. The bounds exhibit only polynomial dependence on both d and a. 



1. Introduction 

Let A; be a number field of degree d = [k : Q], and let E/k be an elliptic curve. 
In JTj , Merel used deep facts about the arithmetic of modular curves to prove that 
there is a universal bound C{d) depending only on d such that -E(fc)t or | < C{d). 
Quantitative refinements of Merel's theorem due to Parent an d Oesterle (cf. 
0) give bounds for C(d) which depend exponentially on d, but it is still unknown 
whether one can bound C(d) by an expression whose growth is polynomial in d. 
Such a bound-or a proof that no such bound is possible-would be of value, both 
for its intrinsic interest and for its implications in cryptography; cf. 

In this paper we give an explicit polynomial bound on \E(k)t m \ depending on d 
and the Szpiro ratio a, a certain quantity associated to the elliptic curve E/k, which 
we will now define. Recall that the conductor f^/fe an d the minimal discriminant 
A.E/k are certain integral ideals of Ok that are supported on the primes at which 
E/k has bad reduction. The Szpiro ratio is given by 

m = log|N fc/Q (A g/fc )| 

1 ' ° log|N fc/Q (f B/fc )| 

when E/k has at least one place of bad reduction; by convention we put a = 1 if 
E/k has everywhere good reduction. 

Theorem 1. Let k be a number field of degree d = [k : Q], and let E/k be an 
elliptic curve with Szpiro ratio a. Then 

(2) \E(k) tor \ < Cl da 2 \og{c 2 da 2 ), 

where c x = 134861 and c 2 = 104613. 

Strictly speaking J5J is not a uniform bound, in that the right-hand-side depends 
on the elliptic curve E/k. However, the result is perhaps more interesting in view 
of the fact that one generally expects a to be small. More precisely, let £(fc) denote 
the set (with multiplicities) of Szpiro ratios a of all elliptic curves E/k. A well- 
known conjecture of Szpiro |16| asserts that is bounded, and that 6 is its 
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largest limit point. (As pointed out by Masser [Hi], 6 is in fact a limit point.) An 
analogue of Szpiro's conjecture is known to hold in the function field case, and in 
the number field case it can be shown to follow from the ABC conjecture. 

Note that if Szpiro's conjecture is true, then Theorem ^ gives a uniform bound 
on |£(/c)tor| in terms of k only. The first result showing that Szpiro's conjecture 
implies such a uniform bound is due to Frey; the argument was written up by 
Flexor-Oesterle 0). Also in 0], the authors treated the special case of everywhere 
good reduction (a = 1), giving a bound on |-E(fc) tor | that is exponential in d. In 
7 Hindry-Silverman improved this to a bound of 0(d\ogd)\ thus our bound 
recovers theirs (with slightly different constants) in this special case. In the general 
case, Hindry-Silverman |5] have given a bound on |i?(fc) t or| that is exponential in 
both d and a. Thus the main interest in our bound J2J) is in its explicit nature, and 
in the fact that it exhibits only polynomial growth in both d and a. 

Finally, let us denote by the union of the sets £(fc) over all number fields k 
of degree d = [k : Q] . In view of Merel's Theorem and Theorem it is natural 
to ask whether supS^ and limsupE^ are finite, and if so, how they depend on d. 
Upper bounds on these quantities would amount to a significant strengthening of 
Szpiro's conjecture, however, and would therefore seem to lie well beyond the scope 
of present techniques. 

A problem somewhat related to counting rational torsion points is that of giving 
a lower bound on the Neron-Tate canonical height h(P), for non-torsion rational 
points P, depending explicitly on the relevant data associated to the elliptic curve 
E/k. In particular, a conjecture of Lang asserts that if P £ E(k) is not a torsion 
point, then 

(3) h(P)>clog\N k/Q (A E/k )\, 

where c = c(k) > is a constant depending only on k. We show that one can take 
for c a certain expression depending explicitly on d and a. 

Theorem 2. Let k be a number field of degree d = [k : Q], and let E/k be an 
elliptic curve with minimal discriminant &E/k an d Szpiro ratio a. Then 

(4) h(P)>c(d,a)log\N k/q (A E/k )\ 
for all non-torsion points P £ E(k), where 

(5) ^ ^ = 10 15 d 3 a 6 log 2 (c 2 rfa 2 ) ' 
and c 2 = 104613. 

A consequence of Theorem|21is that Szpiro's conjecture implies Lang's conjecture; 
this fact was originally proved by Hindry-Silverman [S| , who showed that holds 
with a value of c depending exponentially on d and a. Thus again the main interest 
in Q is in the fact that c{d, a) exhibits only polynomial decay in d and a. Compare 
with the results of David 3 , who uses methods from transcendence theory to obtain 
a similar bound. 

Let us now briefly summarize our approach, in which we extend the methods of 
Hindry-Silverman [Jj to include a treatment of the places of bad reduction. Denote 
by Mk the set of all places of k, and given v £ Mk, let k v be the completion 
of k at v. Let h : E(k) — > [0, +00) denote the Neron-Tate canonical height, and 
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recall that given a point P E E(k) \ {O}, we have the local decomposition h(P) = 
Y^, v eM d^v(P)i where d v — [k v : Q v ] is the local degree and A„ : E(k v )\{0} — > K 
is the appropriately normalized Neron local height function (cf. 15. , §VL1.) Given 
a set Z = {Pi, . . . , Ppj} C E(k) of N distinct small rational points, we estimate the 
height-discriminant sum 

(6) h{Z) = jp E h (Pi~Pj), 

l<iJ<N 

from above globally via the parallelogram law, and from below locally using the 
decomposition A(Z) = YIveM ^f^{Z), where 

(7) ^i Z ) = ^ E MPi-Pj)- 

l<i,j<N 

In order to obtain the necessary archimedean estimates we follow [7], using the 
pigeonhole principle to pass to a subset of Z of positive density, all of whose points 
are close to each other in E{k v ) at a particular archimedean place v. At the non- 
archimedean places we give an analytic lower bound on A„(Z) in terms of the 
valuation of the minimal discriminant (cf. Lemma [3] below). Although we will not 
specifically require this interpretation in the present paper, this inequality can be 
viewed as a quantitative form of a local equidistribution principle for small points, as 
developed in pQ. Finally, it is worth noting that we do not decompose the minimal 
discriminant into "small power" and "large power" parts as Hindry- Silverman do 
in Instead, we assemble the local information at the different places of bad 
reduction into global information by a simple application of Jensen's inequality. 

The author would like to express his gratitude to Matt Baker for his many 
valuable suggestions concerning this paper. 

2. Elliptic Curves over Number Fields 

In this section we fix some notation, and we review some of the relevant facts 
concerning elliptic curves over number fields. 

Let Ok be the ring of integers of a number field k, and denote by and 
the set of all archimedean and non-archimedean places of k respectively. For each 
place v 6 A4k we will select the normalized absolute value | |„ that, when restricted 
to Q, coincides with one of the usual archimedean or p-adic absolute values. Given 
a place v S lying over the rational prime p, let O v and M. v be the ring of 
integers and maximal ideal in k v respectively. Let p v = Ok H A4 V denote the prime 
ideal of Ok corresponding to the place v, and let tt v £ A4 V be a uniformizer. Thus 
|N/c/q(P„)| = \O v /A4 v \ — and 1^1^ =p~ 1 / e ", where f v and e v are the residual 
degree and ramification index, respectively. Recall that d v = [k v : Q p ] = e v f v . 

Let E/k be an elliptic curve, and let je denote its j-invariant. The conductor 
and minimal discriminant of E/k are the integral ideals 

(8) U/k= II P^, andA £/fc = [] p£ 

veM° k veM° 

of O/c, respectively, where the exponents rj v and S v are given as follows. Fix a place 
v G and define 5 V = ord„(A„), where A.„ 6 O v denotes the discriminant of 

a minimal Weierstrass equation for E/k v ; thus |A„|„ = 7r^|^". The exponent r\ v 
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of the conductor has a rather complicated Galois-theoretic definition (cf. ^H], § 
IV. 10); alternatively, it can be characterized by Ogg's formula 

(9) 6 V — rj v + m v - 1, 

where m v is the number of components on the special fiber of the minimal proper 
regular model of E/k v (cf. § IV. 11). An immediate consequence of @ is that 
Vv < $v, from which we deduce the lower bound a > 1 on the Szpiro ratio. 

Let Eo(k v ) denote the subgroup of E(k v ) consisting of those points whose re- 
duction (with respect to a minimal Weierstrass equation for E/k v ) is non-singular, 
and let 

(10) c v = \E(k v )/E {k v )\ 

be the cardinality of the quotient. It is well known that the set Eo(k v ), and thus also 
the number c v , do not depend on the choice of the minimal Weierstrass equation 
used to define them. 

Finally, we recall that the various data discussed above are governed by the 
reduction type of E/k v . To be precise, if E/k v has good reduction then S v = rj v = 
and c v = 1 ; if E/k v has split multiplicative reduction then r] v = 1 and c v = S v ; and 
finally, in all other cases we have c„ < 4. For proofs of these assertions cf. JS], § 
IV.9. 

Following Rumely 13 , it will be useful to decompose the non-archimedean Neron 
local height function A„ : E(k v ) \ {O} — > K into a sum 

(11) \ v (P-Q)=i v (P,Q)+j v (P,Q), 

where i v is a nonnegative arithmetic intersection term, and j v is bounded (cf. also 
PP). These functions are most naturally described by considering the cases of 
integral and non-integral j-invariant separately: 

Case 1: \je\v < 1- Then j v is identically zero and X V (P — Q) = i v (P, Q). 

Case 2: \js\v > 1- By Tate's uniformization theory JZj we have maps 

(12) E(k v ) ^%fc„7g z ^R/Z, 

where q G k* with \q\ v = \1/je\v < 1; here the first map is the Tate isomorphism, 
and the second map is given by u ^ log \u\ v / log \q\ v . Let r denote the composition 
of the two maps in 1)12(1. Then 

Jv (P,Q) - ±B 2 (r(P - Q))log\j E \ v , 

where 

1, 1 1 1 



_ 27timt 

2 y " l ~ u ' 12 2tt 2 ^ m 2 

m£Z\{0} 

is the periodic second Bernoulli polynomial. If r(P) ^ r(Q) then i v (P, Q) — 0. On 
the other hand if r(P) = r(Q), then we can select representatives u(P), u(Q) £ 
k*/q z for P and Q under the Tate isomorphism such that |u(P)| v = \u(Q)\ v ; in 
this case 

i v (P,Q) = - log \\-u(P)/u{Q)\ v . 

Finally, we note for future reference that Eo(k v ) = ker(r); in other words, a point 
P E E(k v ) has non-singular reduction if and only if r(P) = 0. 
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3. A Few Preliminary Lemmas 

The following lower bound on the non-archimedean local sum (JJJ is a variant of 
lU, Prop. 1.2. To be precise, our bound (|13[) is given in terms of the valuation of 
the minimal discriminant, rather than the valuation of the ^-invariant as in jo], a 
distinction that is only relevant at the places of additive reduction. 

Lemma 3. Let E/k be an elliptic curve, let v be a non-archimedean place of k, 
and let A v € O v be the discriminant of a minimal Weierstrass equation for E/k v . 
If Z C E(k v ) is a set of N distinct k v -rational points, then 

(13) K{Z)> f^-I)llog|l/A |„. 



c 7 , 

Proof. First, note that 

(14) log + |j B |,, <log|l/A„|„, 

with equality if and only if E/k v has good or multiplicative reduction (cf. |14| . 
Prop. VII.5.1). Also, the Neron local height function satisfies the lower bound 
A„(P) > j2 log|l/A4 u for points P e E (k v ) \ {O} (cf. 15 , Thm. VI.4.1). It 
follows from this and the decomposition (|llfl that 

(15) iv(P,Q) > ^(logll/A^U - log+ li^U) > 
for P — Q 6 Eo(k v ) \ {O}. As i v is nonnegative, it follows that 

(16) Yl iv(Pi,Pj)>(M-N)^(log\l/A v \ v -log + \j E \ v ), 

l<i,j<N 

where M is the number of ordered pairs (Pi, Pj) G Z x Z with Pi — Pj 6 Eo(k v ). 
For each coset C e E(k v )/E (k v ), let N c = \Z n C\. Then 

M= £ 1 

l<i,j<JV 

Pi-Pj£E (k v ) 



(") - E {( 



E ^ 

C(EB((:„)/-E|,(*») 

AT\2 2N C N N 2 ' 



r2 



>- E E 1 



CeE(k v )/E (k v ) v CG-E(fc„)/-E (fe„) 

Combining and (|17|) we find that 

( 18 ) ^ E ^(^,^)> (^-^)^(log|l/A„|„-log+|j £ |„). 

l<i,j<N ' "~ 

i¥=3 
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We will now show that 



l<i,j<N 

i¥=3 



For if \je\v < 1 then both sides of l|19[l are zero. On the other hand, assume 
that \je\v > 1- Since the map r is trivial on the identity component Eo(k v ), it 
is well-defined on the quotient E(k v )/Eo(k v ), which has cardinality c v . Therefore 
r(E(k v )) C (1/cy) C R/Z, and we conclude that e Mmr( ~ p ^ = 1 for all Pj e Z C 
E(k v ), whenever c | m. It follows that 



l<i,3<N l<i,j<N 



log U.E |t 



_« _j_| p 2-Kimr(Pj) 

' 1 m2 / j 



e 



47r 2 ^-^ m 

(20) mGZ\{0} 1<]<N 

N 2 \og\j E \ v ^ 1 N 



2 i- i 
- y^ 1o sI^I 



^ 2 logfek JV, ,. , 
= 12cg -i2 lo g^l"» 

which implies l|19[l. 

Finally, by the decomposition 1|11|) and the lower bounds (|18() and l|19[) . and using 
the inequality (|14|l . we have 



A "( Z ) = ^2 E (i»(^.^) + i«(^.-Pi)) 

l<i,j<N 









> 




1 






> 




1 












= 







i tog 11/A.I. + (1-1)110811/^1, 



which completes the proof. □ 

The following archimedean analogue of Lemma is a quantitative refinement of 
a result due to Elkies (cf. §VL); for a detailed proof see pp, Appendix A. 

Lemma 4. Let E/k be an elliptic curve with j-invariant je, and let Z C E{k) be 
a set of N distinct k- rational points. If v is an archimedean place ofk, then 

(21) K{Z) > - — io g + i^i. - il. 

In order to obtain the necessary non-archimedean estimates we will require the 
following lemma due to Hindry-Silverman (cf. Lemma 1 of [7] and Prop. 2.3 of 
0.) Let j : H = {t e C | 3(t) > 0} — > C denote the modular j-function, and let 
L = Z + rZ be a normalized lattice with rel; thus j( T ) is the j-invariant of the 
elliptic curve C/L. Let A : (C/L) \ {0} — * R denote the Neron function, as given in 

na, §vi.3. 
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Lemma 5 (Hindry-Silverman). If z = n + r 2 T £ C \ {0}, where r\,r-z G K and 
max{|ri|, |r 2 |} < 1/24, then 

K z ) > 2^gmax{l,log|j(r)|}. 

Finally, we will require the following inequality, the proof of which is elementary 
and given in £Q, § 6. 

Lemma 6. Let N > 1 satisfy the bound N < A log N + B for constants A > 0, B > 
0. Then N <{-^ l )(A\ogA + B). 

4. A Bound on the Number of Small Points 

In this section we will prove the results stated in the introduction, which are 
both consequences of the following bound on the number of small rational points. 

Proposition 7. Let k a number field of degree d — [k : Q], and let E/k be an 

elliptic curve with Szpiro ratio a. Then 

(22) \{P G E(k) | h(P) < 1 ° Sl ^3 / 3^ /fc)l }| < c^logMa 2 ), 
where ci = 134861 and c 2 = 104613. 

Proof. To ease notation we will henceforth suppress the subscripts on the notations 
Nfc/Q, &E/k: an< 4 fe/fc- Let S denote the set on the left-hand-side of J22J, and let 
N denote the largest integer satisfying |5| > 24 2 (iV - 1) + 1; thus |5| < 24 2 iV. We 
will show that 

(23) N < (148dcr 2 ) log N + 971dcr 2 . 
Assuming for now that this holds, it follows by Lemma that 

N < (— (U8da 2 log(148dcr 2 ) + 971da 2 ) 

—) U8da 2 log(148e 971 / 148 da 2 ). 
.e — 1 / 

In view of this and the fact that \S\ < 24 2 iV, the bound (1221 follows immediately. 

It now remains only to prove (|23|l . Let i>o be an archimedean place of k, chosen 
so that \je\vo — m&x-v\oo{\3 e\v) ■ We then have a corresponding embedding a : 
E{k) C/L, where L = Z + rZ is a normalized lattice, and |j( T )l = |j'b|« . 
If A : (C/L) \ {0} — -> M denotes the Neron function on the complex torus, then 
A Uo = A o a. 

Divide the torus C/L into the 24 2 parallelograms 

d / , mi-l mi m 2 - 1 m 2 \ 

Prm m-> = \Z = T\ + ToT < Ti < and < To < >, 

mu 2 I 24 ~ 24 24 ~ 24 J 

where 1 < mi,m2 < 24. By the pigeonhole principle there exists a set Z = 
{Pi, . . . , Pat} C S of N distinct points such that a(Z) is contained in one of the 24 2 
parallelograms. In particular, it follows that the difference c(P,) — cr(Pj) between 
any two points in <t(Z) must lie in one of the four parallelograms P mi>m2 , mi, m 2 G 
{1,24}. Therefore, by Lemma|5]we have A.„ (Pj — Pj) > ^ max{l, log |i_B|t> } f° r 
all such pairs. 

It follows from the above considerations that 

(24) A VQ (Z) > max{l,log|j E |„ }; 
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and by Lemma 01 and the maximality of |jb| Uo , we have 

5N J 



EgL , v-^ rfu flogiV 1 , , . 16 i 



(25) 

log AT 1 + 16 
-"^v""l2AT l0g l]E ^~5N- 
If N < 24c?, then (|23|1 plainly holds; so we may henceforth assume that N > 24d+l. 
Then combining the estimates (12411 and (|25|) we have 

(26) 

> — AJZ) > + — <^ i ^max{l,log j E \v n } 



logiV 



> 



16 1 r AT - 1 li 
5iV + 771 288d ~ 12 J 



2N 

logiV 197 (iV-l) 
2~/V~ ~ 60iV + 288oW ' 

We now turn to the lower bounds on A V (Z) at the non-archimedean places; in 
particular we will show that 

<m £ t^ (z) -t^(t^-j^) 1os|n(A)I ' 

First, if -E/A; has everywhere good reduction then l|27|) plainly holds, since in that 
case log |N(A)| = and the left-hand-side is nonnegative by Lemma|3 So we may 
assume that the set M.^ = {»£ \ E/k v has bad reduction} is nonempty. 

At this point we will require Jensen's inequality: if {w v } is a finite set of positive 
weights with w v = W, if x v > for all v, and if 4>(x) is a convex function for 
x > 0, then 

(28) £ w v<t>{xv) > W ${w £ WvXv 

V V 

Applying this with <fi(x) — 1/x, with {w v — r] v log\N(p v )\ \ v G M^ 1 '} as our 
weights (thus W = log |N(f)|), and with x v = r) v c%/5 v , we have 



log|N(p.)| > £ ^log|N( Pl ,)|) _1 

(29) >W 2 ( £ 16^1og|N(p„)|) _1 

(logN(f)) 2 
161og|N(A)|' 

The first inequality in (|29|) is Jensen's, and the second follows from the inequality 

2 2 

(30) 5p < 165 v 

Ov 

for u <E A'i^. (To see this, recall that if E/k v has split multiplicative reduction 
then 5 V = c v and r\ v = 1, and (|3(J[1 holds; while otherwise 77,, < and c„ < 4, and 
(|3UII follows in this case as well.) 
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Finally, combining (fT^|l and (|2*^|l and noting that d v log ll/A^I^ = S v log |N(p„)|, 
we have 



d v ' ~ ^ d V c 2 A/7 12 

= ib E (~^)Mog| NW | 

■u(EA4£ r 

>-Lr"° e w»" 2 .-ii,g|N(A)i) 



12dV161og|N(A)| TV 
1,1 1 



12d\16fj 2 AT 



log|N(A)|, 



which is l(77|) . 



We are now ready to combine these local estimates. By the parallelogram law 
we have the global upper bound 



4 N 



K{z)<-Y,KPi) 

(31) 

log|N(A)| 



< 



2 11 3da 2 



on the sum defined in JSJl, by the upper bound on h(P) for P <E S. Therefore by 
(|2l)jl and fgfi we have 

log|N(A)|^ A(z) 



2 n 3dCT 2 

(32) = E d i^{Z) 



logiV 197 (AT-1) 1/1 h, ,^ T/AS , 

> h H o log N A) . 

2N 60AT 288d7V 12dV16cr 2 AT/ 61 v n 

If -E/A; has everywhere good reduction then, then a — 1 and log |N(A)| = 0, and 
thus (|32|) becomes 

(33) log AT 197 (AT-1) 

V ' ~ 2N 60N 288dN ' 

It follows that AT < 144dlogAf + 945. 6d+ 1 < 144dlogAf + 946.6d, and holds 
in this case. On the other hand, suppose that A4)l r is nonempty. If A^ < 2 9 er 2 , then 
(I23|l holds, so we may assume that N > 2 9 er 2 . The bound implies that 

0.-^-S + ^f^^)lo g |N(A), ^1^)1 



(34) 



2N 60N 12dV16cr 2 NJ 6,1 v " 2 n 3der 2 
\ogN 197 1/1 1 1 \ . / a \ i 

--^-^ + T2dfe-2^-2^J l0g|N(A)l 
log AT 197 5 , 

log AT 197 5 log 2 



> 



2N 60N 2 w da 2 ' 
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since log |N(A)| > log 2. We deduce that 




< (148Ar 2 )logiV + 971cfcr 2 . 



Thus (|23l) holds in this case as well, and the proof of Proposition[7]is complete. □ 

Proof of Theorems^ and\^ Again, let S denote the set on the left-hand-side of 
l)22[l. Theorem ^ follows trivially from Proposition since E(k)tor C S. To see 
that Theorem [3 follows, let P € E(k) be a non-torsion point, and let M be the 
largest integer such that h((M-l)P) < (log \N k/q (A E/k )\)/2 13 3do- 2 . Then the first 
M multiples O, P, 2P, . . . , (M — 1)P of P are contained in the set S, and therefore 
M < cida 2 log(c2C?cr 2 ) by Proposition [7| On the other hand, by the maximality of 
M we have M 2 h(P) = h(MP) > (log \N k/Q (A E/k )\)/2 13 3da 2 . Therefore 
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2 13 3da 2 
> c(d,a)log\N k/Q {A E/k )\, 



where c(d,a) is given by 
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